Streamlining Functional XML Processing∗
نویسنده
چکیده
Since an XML document has tree structure, XML transformations are ordinarily defined as recursive functions over the tree. Their direct implementation often causes inefficient memory usage because the input XML tree needs to be completely stored in memory. In contrast, XML stream processing can minimize the memory usage and execution time since it begins to output the transformation result before reading the whole input. However, it is much harder to write the XML transformation program in stream processing style than in functional style because stream processing requires stateful programming. In this paper, we propose a method for automatic derivation of XML stream processor from XML tree transformation written in functional style. We use an extension of macro forest transducers as a model of functional XML processing. Since an XML parser is represented by (infinitary) top-down tree transducer, the automatic derivation of XML stream processor is based on the composition of the top-down tree transducer and the extension of macro forest transducers.
منابع مشابه
fxp{ Processing Structured Documents in SML
Xml, the Extensible Markup Language, is a standard for storing and exchanging hierarchically structured text. Due to Xml's tree-like view of documents, modern functional programming languages such as Sml, with trees as the basic data types, are well-suited for implementing Xml processing software. The basis of all such applications is an Xml parser, serving as a front-end to the Xml syntax. Thi...
متن کاملParallel processing model for XML parsing
In this paper, are presented some development problems and solutions concerning the parallel implementation of an algebraic method for XML data processing. It is in tight connection with modern concepts of the parallel programming. The proposed parallel algorithm first partitions the XML document into chunks and then apply the parallel model to process each chunk of XML tree. In the article are...
متن کاملStreamlining the CERIF XML Data Exchange Format Towards CERIF 2.0
The Common European Research Information Format (CERIF) is an established standard for Current Research Information Systems (CRISs) facing the increasing need for information sharing and exchange. euroCRIS released the first official CERIF XML exchange format in 2007; it followed the structure of the relational data model. Based on experience with the format and consulting with the CRIS communi...
متن کاملPurifying XML Structures
XML has a tree structure, which allows various data expressions. In this work, we show the well-known notion of the functional dependency has an essential role in XML data modeling. The existing proposals of functional dependencies for XML are based on fixed paths, imposing severe structural constraints on XML data. We address this problem by introducing node-based functional dependencies so as...
متن کاملFunctional OO Programming with Triangular Circles
Given the capabilities of OO programming, its maturity and generality, it is a sane expectation to adopt OO as the paradigm of choice for XML processing. To serve that expectation, one would map XML schemas to object models so that XML data can be processed through familiar objects. The first generation of mapping technologies has been somewhat disappointing and the term X/O impedance mismatch ...
متن کامل